University of Essex at the TAC 2011 MultiLingual Summarisation Pilot
نویسندگان
چکیده
We present the results of our Arabic and English runs at the TAC 2011 Multilingual summarisation (MultiLing) task. We participated with centroid-based clustering for multidocument summarisation. The automatically generated Arabic and English summaries were evaluated by human participants and by two automatic evaluation metrics, ROUGE and AutoSummENG. The results are compared with the other systems that participated in the same track on both Arabic and English languages. Our Arabic summariser performed particularly well in the human evaluation.
منابع مشابه
Creating language resources for under-resourced languages: methodologies, and experiments with Arabic
Language resources are important for those working on computational methods to analyse and study languages. These resources are needed to help advancing the research in fields such as natural language processing, machine learning, information retrieval and text analysis in general. We describe the creation of useful resources for languages that currently lack them, taking resources for Arabic s...
متن کاملUsing SUMMA for Language Independent Summarization at TAC 2011
The paper describes a language independent multi-document centroid-based summarization system. The system has been evaluated in the 2011 TAC Multilingual Summarization pilot task where summaries were automatically produced for document clusters in Arabic, English, French and Hindi. The system had a reasonable performance in content selection for languages such as Arabic and Hindi and medium per...
متن کاملLIF at TAC MultiLing: Towards a Truly Language Independent Summarizer
This paper presents the LIF system for the TAC’2011 Multilingual pilot track. We followed a language-independent approach to summarization for this task. In particular, we tried to remove the following dependences to language: sentence segmentation, word segmentation, stop-word lists, and word-level relevance assessment. We applied these modifications to an MMR-based system and observed little ...
متن کاملJRC's Participation at TAC 2011: Guided and MultiLingual Summarization Tasks
The paper describes our participation in the Guided and Multilingual Summarization Tasks at the Text Analysis Conference 2011 (TAC’11). We participated in the Guided task with the system from the previous year which combines aspect identification by an event extraction system and automatically learned lexicons with LSA-based summarizer. This year we included temporal analysis to improve sentenc...
متن کاملFinding the Best Approach for Multi-lingual Text Summarisation: A Comparative Analysis
This paper addresses the problem of multilingual text summarisation. The goal is to analyse three approaches for generating summaries in four languages (English, Spanish, German and French), in order to determine the best one to adopt when tackling this issue. The proposed approaches rely on: i) language-independent techniques; ii) language-specific resources; and iii) machine translation resou...
متن کامل